NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fast and efficient template-mediated synthesis of genetic variants

https://doi.org/10.1038/s41592-023-01868-1

Liu, Liyuan; Huang, Yiming; Wang, Harris H. (June 2023, Nature Methods)

Full Text Available
Achieving strength and ductility synergy via a nanoscale superlattice precipitate in a cast Mg-Y-Zn-Er alloy

https://doi.org/10.1016/j.ijplas.2023.103558

Fan, Mingyu; Zhang, Zhongwu; Cui, Ye; Liu, Liyuan; Liu, Yingwei; Liaw, Peter K. (April 2023, International Journal of Plasticity)

Full Text Available
Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting

Dong, Chengyu; Liu, Liyuan; Shang, Jingbo (January 2022, Advances in neural information processing systems)

We show that label noise exists in adversarial training. Such label noise is due to the mismatch between the true label distribution of adversarial examples and the label inherited from clean examples – the true label distribution is distorted by the adversarial perturbation, but is neglected by the common practice that inherits labels from clean examples. Recognizing label noise sheds insights on the prevalence of robust overfitting in adversarial training, and explains its intriguing dependence on perturbation radius and data quality. Also, our label noise perspective aligns well with our observations of the epoch-wise double descent in adversarial training. Guided by our analyses, we proposed a method to automatically calibrate the label to address the label noise and robust overfitting. Our method achieves consistent performance improvements across various models and datasets without introducing new hyper-parameters or additional tuning.
more » « less
Full Text Available
Photoactive Control of Surface-Enhanced Raman Scattering with Reduced Graphene Oxide in Gas Atmosphere

https://doi.org/10.1021/acsnano.1c07695

Zhou, Lu; Pusey-Nazzaro, Lauren; Ren, Guanhua; Chen, Ligang; Liu, Liyuan; Zhang, Wentao; Yang, Li; Zhou, Jun; Han, Jiaguang (January 2022, ACS Nano)

Full Text Available
Enhanced strength-ductility synergy via novel bifunctional nano-precipitates in a high-entropy alloy

https://doi.org/10.1016/j.ijplas.2022.103235

Liu, Liyuan; Zhang, Yang; Li, Junpeng; Fan, Mingyu; Wang, Xiyu; Wu, Guangchuan; Yang, Zhongbo; Luan, Junhua; Jiao, Zengbao; Liu, Chain Tsuan; et al (June 2022, International Journal of Plasticity)

Full Text Available
UCPhrase: Unsupervised Context-aware Quality Phrase Tagging

https://doi.org/10.1145/3447548.3467397

Gu, Xiaotao; Wang, Zihan; Bi, Zhenyu; Meng, Yu; Liu, Liyuan; Han, Jiawei; Shang, Jingbo (August 2021, KDD'21:The 27th {ACM} {SIGKDD} Conference on Knowledge Discovery and Data Mining, August 14-18, 2021)
null (Ed.)
Identifying and understanding quality phrases from context is a fundamental task in text mining. The most challenging part of this task arguably lies in uncommon, emerging, and domain-specific phrases. The infrequent nature of these phrases significantly hurts the performance of phrase mining methods that rely on sufficient phrase occurrences in the input corpus. Context-aware tagging models, though not restricted by frequency, heavily rely on domain experts for either massive sentence-level gold labels or handcrafted gazetteers. In this work, we propose UCPhrase, a novel unsupervised context-aware quality phrase tagger. Specifically, we induce high-quality phrase spans as silver labels from consistently co-occurring word sequences within each document. Compared with typical context-agnostic distant supervision based on existing knowledge bases (KBs), our silver labels root deeply in the input domain and context, thus having unique advantages in preserving contextual completeness and capturing emerging, out-of-KB phrases. Training a conventional neural tagger based on silver labels usually faces the risk of overfitting phrase surface names. Alternatively, we observe that the contextualized attention maps generated from a Transformer-based neural language model effectively reveal the connections between words in a surface-agnostic way. Therefore, we pair such attention maps with the silver labels to train a lightweight span prediction model, which can be applied to new input to recognize (unseen) quality phrases regardless of their surface names or frequency. Thorough experiments on various tasks and datasets, including corpus-level phrase ranking, document-level keyphrase extraction, and sentence-level phrase tagging, demonstrate the superiority of our design over state-of-the-art pre-trained, unsupervised, and distantly supervised methods.
more » « less
Full Text Available
Cryo-EM structure of the SARS-CoV-2 Omicron spike

https://doi.org/10.1016/j.celrep.2022.110428

Cerutti, Gabriele; Guo, Yicheng; Liu, Lihong; Liu, Liyuan; Zhang, Zhening; Luo, Yang; Huang, Yiming; Wang, Harris H.; Ho, David D.; Sheng, Zizhang; et al (March 2022, Cell Reports)

Full Text Available
Graph Clustering with Embedding Propagation

https://doi.org/10.1109/BigData50022.2020.9378031

Yang, Carl; Liu, Liyuan; Liu, Mengxiong; Wang, Zongyi; Zhang, Chao; Han, Jiawei (December 2020, BigData'20: IEEE 2020 Int. Conf. on Big Data, Dec. 2020)
null (Ed.)
In the past decade, the amount of attributed network data has skyrocketed, and the problem of identifying their underlying group structures has received significant attention. By leveraging both attribute and link information, recent state-of-the-art network clustering methods have achieved significant improvements on relatively clean datasets. However, the noisy nature of real-world attributed networks has long been overlooked, which leads to degraded performance facing missing or inaccurate attributes and links. In this work, we overcome such weaknesses by marrying the strengths of clustering and embedding on attributed networks. Specifically, we propose GRACE (GRAph Clustering with Embedding propagation), to simultaneously learn network representations and identify network clusters in an end-to-end manner. It employs deep denoise autoencoders to generate robust network embeddings from node attributes, propagates the embeddings in the network to capture node interactions, and detects clusters based on the stable state of embedding propagation. To provide more insight, we further analyze GRACE in a theoretical manner and find its underlying connections with two canonical approaches for network modeling. Extensive experiments on six real-world attributed networks demonstrate the superiority of GRACE over various baselines from the state-of-the-art. Remarkably, GRACE improves the averaged performance of the strongest baseline from 0.43 to 0.52, yielding a 21% relative improvement. Controlled experiments and case studies further verify our intuitions and demonstrate the ability of GRACE to handle noisy information in real-world attributed networks.
more » « less
Full Text Available
Joint Aspect-Sentiment Analysis with Minimal User Guidance

https://doi.org/10.1145/3397271.3401179

Zhuang, Honglei; Guo, Fang; Zhang, Chao; Liu, Liyuan; Han, Jiawei (July 2020, Proceedings of the 43rd International {ACM} {SIGIR} conference on research and development in Information Retrieval, {SIGIR} 2020, July 25-30, 2020)
null (Ed.)
Full Text Available
NetTaxo: Automated Topic Taxonomy Construction from Text-Rich Network

https://doi.org/10.1145/3366423.3380259

Shang, Jingbo; Zhang, Xinyang; Liu, Liyuan; Li, Sha; Han, Jiawei (April 2020, WWW '20: The Web Conference 2020)

The automated construction of topic taxonomies can benefit numerous applications, including web search, recommendation, and knowledge discovery. One of the major advantages of automatic taxonomy construction is the ability to capture corpus-specific information and adapt to different scenarios. To better reflect the characteristics of a corpus, we take the meta-data of documents into consideration and view the corpus as a text-rich network. In this paper, we propose NetTaxo, a novel automatic topic taxonomy construction framework, which goes beyond the existing paradigm and allows text data to collaborate with network structure. Specifically, we learn term embeddings from both text and network as contexts. Network motifs are adopted to capture appropriate network contexts. We conduct an instance-level selection for motifs, which further refines term embedding according to the granularity and semantics of each taxonomy node. Clustering is then applied to obtain sub-topics under a taxonomy node. Extensive experiments on two real-world datasets demonstrate the superiority of our method over the state-of-the-art, and further verify the effectiveness and importance of instance-level motif selection.
more » « less
Full Text Available

« Prev Next »

Search for: All records